Grammar fragment acquisition using syntactic and semantic clustering

نویسندگان

  • Kazuhiro Arai
  • Jeremy H. Wright
  • Giuseppe Riccardi
  • Allen L. Gorin
چکیده

A new method for automatically acquiring grammar fragments for understanding uently spoken language is proposed. The goal of this method is to generate a collection of grammar fragments each representing a set of syntactically and semantically similar phrases. First phrases observed frequently in the training set are selected as candidates. Each candidate phrase has three associated probability distributions: of succeeding contexts, of preceding contexts, and of associated machine actions. The similarity between candidate phrases is measured by applying the Kullback-Leibler distance to three probability distributions. Candidate phrases which are close in all three distances are clustered into a grammar fragment. This approach detected 246 phrases in the test-set that were not present in the training-set. Experimental results show that a 3% improvement in the call-type classi cation performance has been achieved by introducing these fragments. key words spoken understanding, preceding and succeeding contexts, Kullback-Leibler distance, phrase similarity, phrase clustering

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Engineering in Persian Dependency Parser

Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...

متن کامل

Acquisition of English Prenominal and Postnominal Genitives

This study examined the acquisition of prenominal and postnominal genitives by Iranian EFL learners. Two variables were considered: possessive categories and language proficiency. We considered the influence of possessive categories such as lexical modifier, semantic relationship, and weight and syntactic complexity on genitive alternations by Iranian EFL learners. Also, we examined whether the...

متن کامل

Emergent Functional Grammar for Space

This chapter explores a semantics-oriented approach to the origins of syntactic structure. It reports on preliminary experiments whereby speakers introduce hierarchical constructions and grammatical markers to express which conceptualization strategy hearers are supposed to invoke. This grammatical information helps hearers to avoid semantic ambiguity or errors in interpretation. A simulation s...

متن کامل

Semiautomatic Acquisition of Semantic Structures for Understanding Domain-Specific Natural Language Queries

ÐThis paper describes a methodology for semiautomatic grammar induction from unannotated corpora of information-seeking queries in a restricted domain. The grammar contains both semantic and syntactic structures, which are conducive to (spoken) natural language understanding. Our work aims to ameliorate the reliance of grammar development on expert handcrafting or on the availability of annotat...

متن کامل

Improving Verb Clustering with Automatically Acquired Selectional Preferences

In previous research in automatic verb classification, syntactic features have proved the most useful features, although manual classifications rely heavily on semantic features. We show, in contrast with previous work, that considerable additional improvement can be obtained by using semantic features in automatic classification: verb selectional preferences acquired from corpus data using a f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 27  شماره 

صفحات  -

تاریخ انتشار 1998